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<N ABSTRACT 
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Some of the cleanest signals for new physics in the early runs of the LHC will involve 
strongly-produced particles which give rise to multiple leptons by undergoing cascade 



decays through weakly-interacting states to stable particles. Some of the most spectac- 
ular final states will involve three or more leptons, multiple jets and generally missing 
energy-momentum as well. A triad of the most interesting models of new physics which 
>^ ■ induce such signals is known to consist of (i) supersymmetry with i?-parity conserva- 

tion, (ii) a universal extra dimension with conservation of KK-parity and (Hi) little 
Higgs models with conserved T-parity. Similar signals could also arise if the Standard 
Model is augmented with a fourth sequential generation of heavy fermions. We study 
all these possibilities and show that a judiciously chosen set of observables, critically 
involving the number of identifiable jets and leptons, can collectively provide distinct 
footprints for each of these models. In fact, simple pairwise correlation of such ob- 
servables can enable unambiguous identification of the underlying model, even with a 
relatively small data sample. 
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1 Introduction 



After some initial glitches, the Large Hadron Collider (LHC) at CERN, Geneva, is expected 
to start colliding protons with protons at the end of the current year (2009). Once the 
calibration stage is over, the start-up centre-of-mass energy has been fixed at 7 TeV and it is 
later expected to increase through 10 TeV to the final goal of 14 TeV p]. Initial luminosity 
targets are somewhat modest - being estimated at 60 pb" 1 in the first year. Though no 
official statements on further progress are available, educated guesses put it at around 1 fb _1 
in about two years of running, and perhaps 5 fb _1 at the end of three years. Thus, an eventual 
goal of 30 (or even 50 fb _1 ) integrated luminosity seems not unattainable, assuming that 
the LHC will run without unscheduled breaks for at least 10 years, with some later upgrade 
in luminosity. At the present juncture, 100 fb _1 of integrated luminosity seems definitely 
unattainable. 

The search for signals of new physics at the LHC, especially in the early runs, will be based 
on three requirements, viz. large cross-sections, clean triggers and distinctive final states. 
Obviously, large cross-sections can be obtained if the underlying process involves strong 
interactions. A very rough estimate of the cross-section for a2->2 process of this kind is 



which, for ^fs ~ 1 TeV, comes out as cr ~ 4 pb. With the luminosity estimates mentioned 
above, this would mean 4 000, 20 000, and at least 120 000 events at the end of 2, 3 and 10 
years respectively. Thus, we shall have fairly copious production of the particles in question. 

Particles which are produced through strong interactions will generally decay through strong 
interactions, unless forbidden to do so by some conservation law. One would, therefore, 
expect the principal decay modes following production of new, strongly-interacting particles 
to produce hadronic final states, generically with a small number of identifiable jets. Such 
signals would probably be completely lost in the huge QCD backgrounds expected at a high 
energy proton-proton machine. For example, the cross-section for dijet production at the 
LHC is of the order of microbarns [2], which implies a few million dijet events in the first 
two years of running. For this reason, in the messy environment of a hadron machine, clean 
triggers are traditionally associated with leptonic final states - with or without large amounts 
of missing energy and momentum. This allows us to classify signals obtainable at the LHC 
into three types, viz. 

1. Final states containing easily-tagged leptons which are produced directly through in- 
teractions of electroweak strength. 
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2. Final states containing easily-tagged leptons which are produced in cascade decays of 
parent particles which are produced through strong interactions. 

3. Final states containing hadronic jets which arise either directly from strong interactions 
or from hadronic decays of parent particles produced in strong interactions. 

The first two types of signals are easy to tag and study, and the last two correspond to large 
cross-sections. This means that the most promising signals correspond to the second type, 
i.e. leptonic final states arising from cascade decays of strongly produced particles. These 
have large cross-sections and are easy to tag as well. However, in order that such signals 
should occur, 

• the 'new physics' must have a very definite kind of mass spectrum, with two classes 
of particles, viz. (i) the strongly-interacting 'new' particles, which should be heavier 
than the (iz) weakly-interacting 'new' particles which decay to leptons, and 

• there should be a conserved quantum number (or nearly-conserved quantum number) 
carried by all the 'new' particles which ensures that the first class of particle does not 
decay directly to strongly-interacting Standard Model (SM) particles, but cascades 
down through particles of the second class. 

These conditions automatically restrict us to a small set of existing models, of which only 
four may be considered popular options. These are: 

1. Supersymmetric models with conservation of R-parity: It has been known for a long 
time that in the constrained minimal supersymmetric model (cMSSM), which is based 
on minimal supergravity (mSUGRA) and universality of parameters at a very high 
scale [3], the strongly interacting sparticles - the squarks and gluinos - though degen- 
erate with the weakly-interacting ones at the gauge unification scale, generally tend to 
become heavier at the TeV scale because of the renormalisation group (RG) evolution 
of their masses. These squarks and gluinos undergo cascade decays down to the lightest 
supersymmetric particle (LSP), which is stable and invisible if i?-parity is conserved, 
producing leptons on the way. Extensive studies of such signals are available in the 
literature [1]. 

2. Universal extra dimension models with conservation of KK-parity: In the UED(5) 
model, which has an extra spatial dimension with the topology of a circle folded about 
one of its diameters, i.e. S < - 1 YZ 2 , the extra dimension can be accessed by all the 
SM fields[5j. The Kaluza-Klein (KK) excitations of all fields which belong to the 
same Kaluza-Klein number (n = 1,2,...) are degenerate at the tree-level, but this 
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degeneracy is split by radiative corrections [6]. Stronger interactions and larger colour 
factors then combine to make the KK excitations of quarks (q n ) and gluons (g n ) heavier 
than the others. These strongly-interacting particles then undergo cascade decays down 
to the lightest KK particle (LKP), which is stable and invisible if KK-parity (—1)™ is 
conserved, producing leptons on the way. Such signals have also been studied[7], though 
not with the same level of detail and sophistication as the corresponding signals from 
sup er symmet ry. 

3. Little Higgs models with conservation of T -parity: Little Higgs models have an extended 
gauge symmetry which is spontaneously broken, yielding a light Higgs boson as a 
Nambu-Goldstone boson, which acquires its mass through small radiative effects. The 
presence of two sets of gauge bosons and scalars, forming irreducible representations 
of two direct product gauge groups, ensures that the dominant terms in the radiative 
corrections to the light Higgs boson cancel, thereby postponing the hierarchy problem 
to a scale of some tens of TeV, which is out of the kinematic reach of the LHC[H]. 
Phenomenological constraints from electroweak precision tests require the introduction 
of a conserved quantum number called T-parity, which protects the other SM particles 
from acquiring unacceptably large masses[9]. The LHC signals of this LH(T) model 
have been studiedpJJ], but nowhere near as comprehensively as in the previous two 
cases. In fact, this model contains heavy T-odd partners of the quarks, which can 
undergo cascade decays down to the lightest T-odd particle (LTP), which is stable and 
invisible if T-parity is conserved, producing leptons on the way. The requirement that 
the stable LTP, a dark matter candidate, be neutral and weakly-interacting, ensures 
that the T-odd quarks will be heavier than at least one other particle, which, in turn, 
ensures that there may be three-body decays with leptons, if not cascades. 

4. A fourth sequential generation with heavy 64 quarks: Though precision electroweak 
tests rule out the presence of a SM with a degenerate fourth generation, it is still 
possiblepT], with a modest amount of fine-tuning, to accommodate a heavy fourth 
generation with non-degenerate SU(2)i partners. All that is needed is a mass pattern 
such that the fourth generation contribution to the S parameter 
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lies within the experimental error. With four unknown parameters, viz. m t4 , m^,™^ 
and m T4 , this is not difficult to arrange. At the same time, if \m ti — m^A is greater than 
about 50 GeV, one gets too large a contribution to the T parameter [IT]. Given such 
a spectrum, and reasonable values of the 4x4 Cabibbo-Kobayashi-Maskawa (CKM) 
matrix elements V ti b ~ V#, 4 ~ 0.1, it is possible for the £ 4 and 64 to decay to final states 
containing two or more leptons. In this case, the approximate conservation of flavour 
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implied by the smallness of the off-diagonal CKM elements ensures that there will 
be cascade decays, not through new particle states, but through heavy flavour states 
(mostly t and b quarks), with leptons as the end product. This SM(4) scenario, the 
simplest of all four options, produces similar-looking signals to the other cases, even 
though it does not offer a dark matter candidate. 

Of the above models, the first three, viz. cMSSM, UED(5) and LH(T) have some attractive 
features in common. The Higgs boson mass is protected from large radiative corrections, 
either by cancellation, or by bringing in new physics at a scale of a few TeV to a few tens 
of TeV, and there exists a stable, weakly-interacting particle (the LSP, LKP or LTP), which 
is an excellent dark matter candidate. The overwhelming observational evidence that dark 
matter exists in the cosmos and is non-baryonic in nature [12], has made it a real challenge 
to find out its nature. Thus, models like the above, which not only solve the decades-old 
hierarchy problem but also provide a simple theory of dark matter, lie at the forefront of the 
hypotheses that we would like to test when the LHC begins to probe the TeV scale. At the 
same time, a fourth generation is a simple and natural extension of the SM, and may offer 
clues to the mystery of the pattern of fermion masses and mixings. Hence, although the 
SM(4) scenario neither proffers a dark matter candidate nor prevents the Higgs boson from 
picking up large self-energy corrections, its very simplicity would make it a natural theme of 
study at the LHC. 

Having established that all of the four models discussed above are interesting in their own 
right and can yield similar-looking multi-lepton event topologies at the LHC, the question 
immediately arises: If some excess over the SM prediction is indeed observed at the LHC 
in the multi-lepton channels, which of these models could be the likely explanation? This 
question is a specific case of the larger 'LHC inverse problem', which addresses the general 
issue of the nature and properties of any new physics which may be discovered at the LHC. 
Experience from the past tells us that if indeed some deviation from the SM is announced, 
it is sure to result in a scramble among particle theorists to prove how this deviation fits 
in with one's favourite model! Quite likely, more than one of the above models will fit the 
observed events for some specific choice - not necessarily unique - of the model parameters. 
However, again experience from the past tells us that the hypothesis which fits one particular 
set of data - say the invariant mass distribution - may not fit a different set of data - say 
the angular distribution - for the same set of events. Only if one chooses the correct model 
should we expect to get a good fit across all kinds of observables. It would be like the case 
of the shoe fitting all the available footprints and leading to an unambiguous identification. 
However, except for some exploratory attempts [13]. the LHC inverse problem has not been 
studied in full seriousness by the high energy physics community. There has been an attitude 
of watch-and-wait with most workers in the field, expecting to actually see a deviation from 
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the SM in the LHC runs and only then to proceed to fit it, as was the practice in the past. 
Against this, it is argued that the LHC and its physics calls for a new approach. Given the 
level of expectation aroused by this machine, the exciting fact that it explores a hitherto- 
untouched energy regime and the more sobering fact that there is no obvious successor of its 
kind, the smallest hint of new physics at the LHC would call for an immediate theoretical 
analysis, with the inverse problem following hard on the heels of any announced deviation. 

Can we devise a quicker method to identify new physics than the traditional x 2 -fitting (which 
would, of course, have to be done eventually)? To be successful, such a method would have 
to be simple and robust and preferably economic in the sense that it should use data which 
will be collected anyway. Keeping this in mind, the present study focusses on final states at 
the LHC which have 

(a) 3 or 4 hard leptons, i.e. 3£ and 4£; 

(b) substantial missing energy fa, and 

(c) an indeterminate number of hadronic jets, i.e. nJ where n = 0, 1, 2, 3, . . .. 

Such states would stand out among the LHC events and would be among the first where 
signals for new physics could be sought. SM backgrounds to such states arise principally 
from WZ or ZZ production, which are necessarily electroweak and hence would have a 
cross-section of the order of a few hundred femtobarns. Given the small branching ratios of 
W and Z to leptons which can be tagged at the LHC, i.e. electrons (e ) and muons (fj, ), 
this means that the background is at the level of only a few events, if at all, for around 
30 fb" 1 worth of data. In the kind of new physics models described above, the probability 
of obtaining the same final states is one - or even two - orders of magnitude greater, which 
means that if something like a few hundred of such events is seen, we shall have a 'smoking 
gun' signal for new physics beyond the SM. These 3£ and 4£ signals are, in fact, much better 
hunting grounds for new physics than signals with one or two final state leptons, because 
the latter have large irreducible SM backgrounds from resonant W and Z production as well 
as ti production. 

Leptonic signals at the LHC with missing energy and with or without hadronic jets, have, of 
course, been studied and described several times in the literature - particularly in the context 
of SUSY searches [E] - and it is well known that they will stand out from the SM background 
rather conspicuously. It is not our purpose, in this work, to reinvent this particular wheel. 
Rather, using the information that these are indeed viable new physics signals, we seek 
to illustrate how the four models described above can lead to similar-looking signals, and 
then to consider ways and means for quickly distinguishing between these models using the 
experimental data which is likely to be available. We reason that each model must leave its 
distinctive imprint in the final states, in the form of kinematics reflecting the mass spectrum, 
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branching ratio structures and angular distributions, as well as jet and lepton multiplicities- 
and our problem is to identify and dig out the correct clues. This is the motivation and 
principal theme of the present work. 

This article is organised as follows. In Section 2, we discuss in more detail how (3£ + MET 
+ nJ) and (A£ + MET + nJ) signals can arise in each of the above models, and what would 
be the distinctive characteristics of events arising from each model. In Section 3, we explain 
our construction of discriminating variables, followed by some educated guesswork on the 
kind of values predicted for the discriminating variables. Section 4 mentions some essential 
features of our numerical analysis and describes our results. Finally, Section 5 comprises a 
summary and critique of our work. 

2 Comparative Anatomy of Multi-Lepton Signals 

In the previous section, we had made a very rough estimate that the cross-section for the 
pair production of two strongly-interacting new particles with masses in the range of a few 
hundred GeV to a TeV would be of the order of a few picobarns. However, even assuming 
that the coupling strengths resemble QCD couplings, the real cross-sections will actually 
depend on several factors, such as (z) the number of new particles being produced, (ii) 
the exact masses of these particles, (Hi) the number of Feynman diagrams contributing to 
the process and possible interference effects, (iv) the actual machine energy and last, but 
not least, [y ) the parton density functions (PDFs) being used for the prediction, especially 
the scale Q 2 at which the PDFs are being evaluated. Apart from this, there will be sub- 
leading effects like initial and final-state radiative corrections, multiparticle interactions as 
well as detector issues such as energy and angular resolution and irreducible effects of pileup 
etc. In the present study, all these minor effects have been neglected and we take only the 
leading order (LO) cross-sections convoluted with PDFs - also at LO. A justification for this 
will be given in the next section. The calculations have been carried out using the event 
generator PYTHIA[15J, cross-checked and/or interfaced with the CalcHEP software[16j. For 
the PDF's, we have used the CTEQ-5 sets[I7] inbuilt in PYTHIA. The advantage of using 
PYTHIA, apart from the fact that it is widely used by theorists and experimentalists alike, 
is that fragmentation and hadronization of partons is taken care of, and a rudimentary jet- 
formation algorithm can be easily implemented using the inbuilt PYCELL routine. More 
details will be presented in the context of individual models of new physics. 

Far more significant than the subleading corrections to the LO calculation of the cross- section 
is the fact that there is still some uncertainty as to the actual machine energy at which the 
experimental data will be collected. It appears to be quite reasonable to hope that this 
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will soon rise above the tentative start-up value of 7 TeV, but caution must be exercised 
in assuming that the energy will thereupon be speedily upgraded to the eventual goal of 
14 TeV. This may, in fact, take several years, and hence, an intermediate energy value of 
10 TeV seems to be the favoured choice for LHC studies at the present juncture. This has 
been followed in the present work. It is, however, entirely possible that the early performance 
of the LHC will be so satisfactory that the energy does gets upgraded to 14 TeV quite soon. 
In that case, we have checked that there will be little qualitative difference in our results, 
though actual numbers will, naturally, be different. 
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Figure 1 : Cross-sections for strong pair-production of new particles at the LHC as a function of the machine 
energy. The thick lines represent the total cross-section in each model. Plotted in the boxes, as marked, are 
(a) the cMSSM with Aq = 0, fi > 0, and other parameters as marked, (b) the LH(T) model with f = 1 TeV 
and the SM(4) with 771(64) — 550 GeV, and (c) the UED(5) model. Note that q stands generically for all 
quarks and their heavy counterparts. 



A rough understanding of how our numerical results will change as the machine energy 
varies from 7 TeV through 10 TeV to 14 TeV can be obtained by inspecting the graphs in 
Figure [TJ These show, for an illustrative choice of a single point in the parameter space of 
each of the four models mentioned in the last section, the variation of the cross-section for 
the production of the main strongly-interacting particles as the machine energy y'i varies 
from 7-14 TeV. The box on the left, marked (a), shows the supersymmetric cross-sections, 
with squark production, in association with an anti-squark or a gluino, dominating the gluino 
pair production. The squark and the gluino are around 600 GeV and 650 GeV, respectively, 
in this plot. This makes them much lighter than the machine energy and indicates that a 
large part of the cross-section comes from initial-state partons with very small momentum 
fractions. The steep rise in PDFs, especially gluon PDFs, for small arguments, is well- 
known and accounts for the fact that these cross-sections, despite being proportional to 
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Oig/s, actually grow with energy ^/s. 

The central box, marked (b) shows cross-sections for the pair production of heavy quarks 
in the LH(T) and SM(4) models. In the LH(T) model the cross-section is for the heavy 
T-odd partners of the generic quark q, which are taken to have a common mass around 
600 GeV, while in the SM(4), the isospin — | component 64 is assumed to have a mass 
around 550 GeV0. Finally, in the box marked (c) on the right, we have plotted cross-sections 
for the UED(5) model, where the particle spectrum lies in the vicinity of around 500 GeV. 
Although these cross-sections also vary quite dramatically with the choice of parameters, 
phase space considerations are rather similar for all the plots in Figure HJ and hence the 
apparent fact that for similar particle masses, the UED cross-sections tend to be larger than 
those in other models, happens to be true for most choices of parameters. 

The curves shown in Figure [1] are solely intended to illustrate the general trend for heavy 
particle production through strong interactions as the centre-of-mass energy ^/s of the LHC 
increases. The absolute values of the cross- sect ions in the three boxes vary widely depending 
on the parameter space. One should not, therefore, jump to conclusions such as imagining 
the SM(4) cross-sections to be generically smaller than the LH(T) cross-sections, which 
seems to be indicated by the central box (6) - this is by no means true for other choices of 
parameters. However, it is reasonable to draw the conclusions that 

• at y/s = 10 TeV, the overall cross-sections for strong production of new, heavy particles 
lies in the wide range of about 100 fb to about 10 pb; and 

• the variation in cross-section as one proceeds from 7 to 14 TeV is about one order of 
magnitude. 

The first conclusion can be used to reinforce the contention made in the previous section 
that these new particles will be produced in huge numbers at the LHC, even with revised 
luminosity estimates. The second can be used to get a (very rough) picture of how our 
results will change as the machine energy increases. 

Once such heavy new particles are produced in proton-proton collisions, they will undergo 
cascade decays as permitted by the conservation laws in the theory, leading, as we have al- 
ready asserted, to multi-lepton signals. These multi-lepton signals accompanied by hadronic 
jets and large amounts of missing transverse energy (MET) are best understood with refer- 
ence to the decay channels depicted in Figure [2] These are now described in a model-wise 
manner. 

1 The cross-section for t^t^ pairs is similar, since the mass-splitting between 64 and t 4 cannot be much 
greater than about 50 GeV, but this cross-section has not been shown here since it is not relevant for our 
analysis. 
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Fi glire 2: Major cascade decay chains leading to multilepton signals with missing energy in pp collisions 
at the LHC, in the four models under consideration. Final states are highlighted by encircling in red for jets 
and green for leptons. Other stable final states are invisible and lead to MET. The cascading process may 
also start from the state immediately to the right of the vertical (broken) line(s). In each box, the upper chain 
leads to one lepton, jet(s) and MET, while the lower chain leads to two leptons, jet(s) and MET. 

Cascade decays in the cMSSM : Depicted in the box marked 'cMSSM', on the upper left side 
of Figure [21 these cascade decays start from the gluino g, though it is also possible to start on 
the right of the vertical broken line from the squark q if that is the produced particle. Be it 
a gluino or a squark, two kinds of cascade are depicted in the figure. The decay of the gluino 
to the squark(s) proceeds through the dominant g —> q + q channel(s) with branching ratio 
unity, but the squark(s) may decay in various ways. If they decay to chargino Xi states, 
the cascade decay follows the upper chain depicted in the figure, culminating in a final state 
with one lepton, jets and MET from the Vt and the LSP Xi- The chargino Xi itself may 
decay through two major channels, depending on its composition in terms of gaugino and 
Higgsino states. For example, if the chargino xt is purely a Wino W ± and the LSP Xi 1S 
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purely a Bino B°, then the SU(2)l <8> U(l)y electroweak symmetry precludes the existence 
of a Xi~X^rW ± vertex, and hence one of the cascade decays depicted in the figure will not 
occur. However, since the final state is the same (£ ± + MET) the char gino branching ratio 
to one lepton is effectively unity 



It is also possible for the squark(s 



to decay into one of the neutralino states x° {i = 1, 2, 3, 4). 
If it decays directly to the LSF^I, i.e. q — > q + x?, then, although the channel will be 



.15 

kinematically favoured, no leptons will be generated. On the other hand it is also possible 
for the squarks to decay to a heavier neutralino states, such as the x° depicted in the figure, 
or even the heavy states xt or Xa- These branching ratios will be very much dependent on 
the choice of parameters. However, cascade decays of a heavy neutralino will lead - again 
through two possible decay chains as depicted - to a final state with £ + £~~ and MET. 

The two major decay chains above result in final states with either £^ + MET + jets, or 
£ + £~ + MET + jets. There are other, more exotic, possibilities. For example, we can have 
a decay chain of the form 

Q — > xf + • • • — > Xi + • • • — > xt + • • • — > X? + • • ■ 

where one can get one lepton from each of the chargino <-> neutralino decays, i.e. three 
leptons in all. However, the nature of the mass spectrum in the cMSSM makes it difficult to 
have other possibilities, since the xi an d the xf are practically degenerate, as are the xt an d 
the xf. Even the four-step decay chain shown here is kinematically disfavoured except in 
some small patches of the parameter space and may be considered a rare process even there. 
We have chosen, therefore, to concentrate mostly on the decay chains depicted in Figure EJ 

Once we decide to concentrate on the one-lepton chain £ + MET + jets and the two-lepton 
chain £ + £~ + MET + jets, there are three possibilities after a production of a pair of the 
strongly-interacting particles g and/or q, viz., 

• both decay through the one-lepton chain, so that the final state is (2£ + MET + jets); 

• one decays through the one-lepton chain while the other decays through the two-lepton 
chain, so that the final state is (3£ + MET + jets); 

• both decay through the two-lepton chain, so that the final state is (4£ + MET + jets). 

The first option, which has a substantial background from Drell-Yan production of leptons 
with some MET which can be due to various causesi is not the focus of our attention in this 



2 We note here that in the cMSSM, the lightest neutralino Xi is always the LSP in the parameter space 
allowed by experimental constraints. 

3 e.g. radiation of a Z which decays invisibly to neutrinos, or radiation of a number of soft gluons which 
form soft, undetected jets. 
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work. It is the second and third options, i.e. the (3£ + MET + jets) and the (4£ + MET + 
jets) signals which interest us, as their SM backgrounds are negligible. The jet multiplicity 
will depend on the initial state as well as the leptonic content, apart from possibilities 
for jet splitting and jet merging. However, we can broadly conclude that the energy of 
the jets will be controlled by the splitting between M(g) and M(q) and between M(q) and 
M(xf) ~ ^(x«D- To quantify this, we define a variable Ami = sup{M(g), M (q)} — M (xf)} ■ 
Similarly, the energy of the emergent lepton(s) will be controlled by the mass difference 
Am 2 = M(xf) -M(x°i)- A scatter plot of these mass differences is shown in Figure El where 
the red dots represent randomly chosen points in the cMSSM parameter space which are 
allowed by all laboratory constraints, including those from LEP-2, radiative -B-decays, etc. 
The strong lower bound on Ami is mostly controlled by the direct bound on the chargino 
mass from LEP-2, while the upper cut-off (near the top of the box) has been imposed by 
demanding that the cross-section for coloured sparticle production at the LHC should not 
fall below 1 fb at -^i = 10 TeV. It is immediately obvious that the cMSSM spectrum in 
the allowed parameter space contains heavy gluino and squark states with masses around a 
TeV or more, while the gaugino states are relatively light. This accounts for the fact that 
Ami is generically above 500 GeV, while Am 2 is generically below 600 GeV. It follows that 
jet energies in the range of many hundreds of GeV's are possible in the cMSSM, while the 
leptons will tend to be softer, being most likely around 200 - 300 GeV. The very hard jets 
in this case will have ample energy to split into more jets and give a high jet multiplicity, 
and usually the leptons will all be well above any reasonable detection thresholds we may 
choose to apply, as a result of which they will be easily detected. 

Cascade decays in the UED(5) model : It is not for nothing that the UED(5) model has 

been dubbed 'bosonic supersymmetry'[7]. The n = 1 particle spectrum and decay chains 

in this model resemble supersymmetry very closely. Yet there are significant differences, as 

the depiction in Figure [2] shows. The upper right-hand box shows how a gi KK excitation 

can decay through a q\ excitation into either a W\ or a Z\ KK excitation, whose further 

decays then yield one lepton or two leptons respectively, with the stable LKP, the B\ (often 

denoted 71) escaping detection and contributing to MET. It should be noted at this point 

that in the UED(5) model, the n = 1 counterpart of the Weinberg angle is very small [6 J 

and hence one may say that the LKP 71 is practically all Bi, while the orthogonal state Z\ 
(3) 

is practically all W{ . This represents a rather different situation from the cMSSM, where 
the LSP xh is almost always a mixture of the B and the W^ 3 \ except in some very small 
patches of the parameter space. The dominantly U(l)y nature of the LKP in the UED(5) 
model means that vertices like W^W^ji « W^W T B 1 are absent, unlike their cMSSM 
counterparts. At the same time, there is no analogue of the cMSSM process x 2 ~ > Xi + Z° 
which is a supersymmetrization of the ZZH vertex in the SM. This is because the LKP, the 
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analogue of the x1, is almost wholly a 
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Fi gure 3: Illustrating mass difference patterns responsible for three of the four models under consideration. 
The variables plotted along the axes are as follows: 

• cMSSM: Am! = sup{M(g) - M{xf),M(q) - M(xf )} and Am 2 = M(xt) ~ M (Xi) 

• UED(5): Ami = sup{M( 5 i) - M (Wf), M{q 1 ) ~ M(W^)} and Am 2 = M (Wf) - Af(B°) 

• LH(T): Ami = M(q T ) - M{W$) and Am 2 = M(W^) ~ M{B%) 

Here, Ami represents an upper bound for jet energies and Am 2 represents an upper bound for lepton energies. 

Even lacking two of the important decay chains which are present in the cMSSM, it is 
still possible for the UED(5) to predict final states with (3£ + MET + jets) and (4£ + 
MET + jets) through the decay chains depicted in Figure [2l i.e. through an intermediate 
lepton number-carrying n = 1 state, which is analogous to the slepton or the sneutrino state 
depicted in the cMSSM. However, we now note that there exists another important difference 
between the cMSSM and the UED(5) model. In the cMSSM, all fermion masses are equal 
(to mi/2) at some high scale around 10 16-17 GeV and, likewise, all boson masses are equal (to 
Too) at the same high scale. Radiative corrections - which depend crucially on the various 
couplings and conserved quantum numbers carried by these states - ensure that these masses 
have very different running properties when calculated at the energy scale of the LHC, using 
renormalisation group (RG) equations. So widely do these vary that the mass spectrum at 
the LHC scale of around a TeV permits mass splittings as large as a TeV (Figure [3] makes 
this quite explicit.). In the UED(5) model, however, effects of compactification of the higher 
dimension are observed at the LHC scale itself, where all KK excitations of level n will 
have a common tree- level mass ni? _1 , where R is the radius of compactification. As in the 
cMSSM, this degenerate mass spectrum is split by radiative corrections, but this time since 
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everything happens at the TeV scale, the radiative corrections remain small, as is only to 
be expected from perturbative effects. There is no RG running here to blow up the small 
mass-splittings into huge differences. 

What the above argument means in practice is that the overall mass difference between the 
heavy g\ and the LKP B\ is not very large. In fact, a good rule of thumb is to take this 
difference as around 15% of the generic n = 1 mass Rr 1 . As particles which are much heavier 
than about 1.5 TeV would have small production cross-sections at the LHC, the mass splitting 
between n = 1 states cannot be much more than about 225 GeV. The mass splittings between 
gi and qi, between qi and W\ and between W\ and Bi « ji must be less than this. Since this 
mass splitting has to account for the transverse energy of the final state jets and leptons, as 
well as the MET, it is clear that none of these quantities can be expected to peak anywhere 
above 100 GeV, though the occasional statistical fluctuation is, of course, always possible. 
The near-degenerate nature of the UED(5) spectrum is nicely illustrated in Figure[3j Here, as 
before, we define a pair of mass- splittings Ami = sup{M(#i) - M(W^), M(q 1 ) - M(W 1 ± )} 
and Arri2 = M(Wi) — M{B\) and show a scatter plot of randomly chosen points in the 
UED(5) parameter space. These are represented by the green dots in Figure El It is clear 
that these splitting are much smaller than the splittings in the cMSSM, as a result of which 
the green dots form a narrow sliver in the Am r Am 2 plane, which does not overlap the 
cMSSM region at all. 

Before passing on to the next model, it is important to discuss the KK modes for n > 2. 
Since the lower bound on R~ x can be as low as some 300-400 GeV[T8], it is easily possible to 
produce the n = 2 KK states (which have even KK-parity) as resonances. The mass of such 
a resonance will be roughly 2i? _1 , which is the same as that of a pair of n = 1 KK states 
having masses each. On the other hand n = 3 resonances, which have odd KK-parity, 
have to be pair-produced, which requires a centre-of-mass energy of around 6R~ l , which 
puts it in the vicinity of 2 TeV or higher. The sharp falling-off of PDFs at such high energies 
(especially if LHC is unable to reach the design goal of 14 TeV) ensures that cross-sections for 
pair-production of n = 3 states will be too small to merit further discussion. However, n = 2 
resonances have been discussed in the literature [19] as a direct test of the UED(5) model 
vis-a-vis supersymmetry Q. In this work, however, we find that it is possible to distinguish 
the UED(5) signals from the cMSSM ones considering the n = 1 states alone, and hence we 
do not pursue the issue of higher n states any further. Such signals, if found, may be taken 
as complementary to the results and arguments presented in this article. 

Cascade decays in the LH(T) model : Little Higgs models [8] have found their own niche 

4 which, being N — 1 supersymmetry in order to have chiral fermions, has no analogue of the higher n 
KK excitations. 
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in the lore of particle physics as they provide a solution to the hierarchy problem which is 
intermediate between that provided by supersymmetry and brane world models with either 
large, flat or small, warped extra dimensions. The introduction of two complementary gauge 
groups 577(2)i x U(l)i and 577(2) 2 x £7(1)2 which are diagonally broken to the SM gauge 
group SU(2) l x U(l)y enables us to cleverly assign quantum numbers to bosons and fermions 
transforming under these in such a way as to cancel the leading quadratic contributions to 
the Higgs boson self-energjjE To keep the extra fermions from making unacceptably large 
contributions to electroweak corrections to the W and Z boson masses, direct couplings 
have to be forbidden by the introduction [9 J of a conserved quantum number called T-parity. 
It turns out that in these LH(T) models, all SM particles P participating in electroweak 
interactions have T = +1 and possess a T = — 1 partner P?. Note that there is no T-odd 
partner for gluons, since they do not participate in electroweak interactions. The T-odd 
partners are generally quite heavy, satisfying [20] 

M(W T )=gf M (B T ) = 9 -^= tM(g) T ] y = «y/ (3) 

where / is a common mass scale (around a TeV) and the Kij form a matrix of Yukawa 
couplings. Keeping the T-odd quarks qx (and, separately, T-odd leptons £?) degenerate helps 
avoid large FCNC effects through a GIM mechanism. The mass splitting between the Wt 
and the B>t works out to about 2.75 M{Bt). Given that the electroweak precision bounds [20] 
on LH(T) models lead to M(B T ) > 80 GeV, it follows that M(W T ) - M(B T ) > 220 GeV, 
which is quite substantial. 

It is apparent from Eqn. ([3]) that the mass spectrum in the LH(T) model is not so tightly 
determined as is the case in the cMSSM and in the UED(5) models. The base parameters 
in the theory are the scale /, the common mass M{qr) and the common It mass M(£j-), 
which can be varied at will. Not every mass scenario in the model, therefore, leads to 3£ and 
47 signals. The ones which do so are compatible with the sketch shown in Figure [2] (box on 
the second row, to the left, marked LH(T)) and have a hierarchy 

M(q T ) > M(W T ), M(Z t ) > M(£ T ) > M(B T ) (4) 

which leads to cascade decays with multi-lepton states. This is not to say that the other 
possibilities (e.g. having the It heavier than the Wt or Zt) are considered in any way less 
favoured. It is simply that such scenarios will not lead to multi-lepton scenarios, and hence 
are not relevant for the inverse problem in the limited sense under consideration. For the 
mass pattern which does give multi-lepton signals, however, Figure [2] shows that the decay 

5 However, this cancellation occurs only to first order, so that the hierarchy problem reappears at higher 
orders. The model is thus able to postpone the hierarchy problem to a higher scale of around 10 TeV, which 
is beyond the accessible energy of the LHC, but not to remove it altogether. 
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pattern for the LH(T) model is very similar to the pattern depicted for the UED(5) model 
to the right of the vertical (broken) line. 

On the whole, the LH(T) scenario may be expected to lead to somewhat smaller cross- 
sections than the cMSSM or the UED(5) because of the absence of any counterpart of 
the g or the gi, and hence, no counterpart of the glj and qg (or g x g\ and q\g\) processes. 
This, by itself, cannot be used as a distinguishing feature, of course, since the LH(T) cross- 
section for a smaller mass range is comparable to the cMSSM or UED(5) cross-section for 
a larger mass range. However, it can be used in conjunction with other parameters, as will 
be seen in the final section. Of more immediate interest is the fact that the mass splittings 
Ami = M(qx) — M(Wt) and Am2 = M(Wt) — M(Bt), which govern the energy of emergent 
jets and leptons respectively, are neither constrained to be as small as in the UED(5) case, 
not required to be as large as in the cMSSM. These intermediate values are plotted as a 
scatter plot in Figure [3j where it is apparent that, except for a small region of overlap with 
the cMSSM region, the LH(T) model largely occupies a different region in the Ami — Am2 
plane. This difference can be effectively exploited in building discriminators, as we show in 
the next section. 

Cascade decays in the SM (4) : As mentioned in the Introduction a sequential fourth gener- 
ation in the SM is still a possibility if there is a modest mass-splitting in the quark sector 
which is matched by a corresponding mass-splitting in the lepton sector, in accordance with 
Equation ([2]). In fact, balancing the constraints from the S and the T parameters, which 
tend to work in opposite directions, one concludes that the mass splitting between and the 
64 can be anything in the range from 0-55 GeV at 95 % CL. Constraints coming from current 
data on B°-B° mixing and Z — > bb are not incompatible with this level of mass-splitting[21j. 

In the presence of a fourth generation the 3x3 Cabibbo-Kobayashi-Maskawa (CKM) matrix 
V requires to be extended to a 4 x 4 matrix, where the mixing elements with the first two 
generations are constrained by unitarity and other considerations to be practically zero. 
There can be more substantial mixing between the third and fourth generations^ with the 
only constraint on V34 coming from from the unitarity of the 4x4 mixing matrix. This 
forces V44 to be close to 1, and V34 < 0.15 at 95 % CL. For values which saturate the latter 
bound, both the £4 and 64 will dominantly decay through this mixing to two-body final states 
through i 4 — > b + W + and 64 — > t + W~, since the CKM-favoured decays £ 4 — > 64 + W + or 
&4 — > £4 + W~ are kinematically impossible when |M(t 4 ) — M(6 4 )| < 55 GeV < M w , and can 
only be realised in three-body decays with a heavy VT-propagator. Of course, the precise 
values of | V34 1 ~ | V43 1 is not important for collider studies. They will only affect the lifetimes 

6 Which is consistent with the hierarchy of mixing observed in the first three generations, viz. neighbouring 
generations mix more. 
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of the fourth-generation quarks and not their decay patterns, since the two-body decay will 
dominate over the three-body one unless this element | V34 1 ~ | V43 1 assumes an unnaturally 
small value of 10~ 4 or less. 

Once we realise that the t 4 and 64 quarks will decay into ordinary b and t quarks, it is clear 
that direct searches at Tevatron can put stringent limit on their masses. In fact, such limits 
do exist [22J, and are about M(£ 4 ) > 325 GeV and about M(6 4 ) > 355 GeV, assuming that 
they decay dominantly to the third generation quarks. There is no upper limit on the masses 
of the £4 and 64, except for the quantum field theoretical constraint that if they are more 
massive than about 600 GeV, the corresponding Yukawa couplings would be large and would 
then quickly hit the Landau pole[llj. 

The presence of a fourth generation of quarks with masses in the range 350-600 GeV could 
easily act as a spoiler for new physics signals in the trilepton and four-lepton channels. The 
box on the lower right side of Figure [2] shows the decay chains which are responsible for 
this. As in the other boxes, the upper chain gives rise to a £ + MET + jets signal, while the 
lower chain gives rise to a £ + £~ + MET + jets signal. We notice that both these chains arise 
from the decay of a 64 quark. If we produce a £4 quark, it will always decay to a 6-quark 
and a VT-boson, producing not more than a single lepton when the VT-boson decays^. Thus, 
the decay of a £ 4 i~4-pair will lead at most to a 2£ + MET + jets signal, which is not under 
consideration in this work. Thus, the i 4 has no role to play in the subsequent discussion and 
will not be considered any further. 

Being a minimal extension of the SM, the SM(4) has no counterpart of the LSP (x?), the 
LKP (Bi) or LTP (Bt)- Hence the only mass-splitting that is relevant is M(b±) — writ, which 
is simply linear in M(& 4 ), varying from about 180-425 GeV. However, this mass difference 
does not represent the energy of any jet or lepton per se, but is shared among five light 
particles according to the kinematic configuration. Accordingly, we do not show any scatter 
plot in Figure [3] corresponding to the SM(4). It turns out, however, that the very lack 
of a characteristic spectrum for the SM(4) makes it difficult - though not impossible - to 
disentangle from the other models. This will be discussed in the following sections. 

Though we concentrate on the four models described above, trilepton and four-lepton sig- 
nals are also possible in some other models. The obvious ones are extensions or variations of 
the above models, such as the many possibilities with supersymmetry[23] - non- universal 
masses, focus-point evolution of masses, even the unconstrained MSSM, as well as the possi- 
bility of i?-parity violation through a LLE coupling. Other possibilities include the extension 

7 Of course, a small number of isolated leptons will also arise when the 6-quark decays semi-leptonically, 
but this can be treated as a sub-leading effect. 
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of the UED(5) model to 6 dimensions |24j. or a composite model with appropriate couplings. 
In fact, it is always possible to build up a model with suitably chosen fields and interactions 
which would produce the given signals. However, such ad hoc creations would lack the mo- 
tivational advantages of the four scenarios discussed in this work. Hence, we have not made 
any attempt to study the (3£ + MET + jets) and (4£ + MET + jets) signals in any models 
other than the four enlisted above. 

It is necessary to add a caveat to the previous paragraph. There is always a possibility 
that when the LHC data become available, we shall see signals in the (3£ + MET + jets) 
and (4£ + MET + jets) channels which do not match in kinematic and other profiles with 
any of the four models considered in this article. In that case, it is precisely the kind of 
'exotic' possibilities mentioned in the last paragraph, to which we would require to turn for 
an explanation. Our purpose is not, therefore, to belittle these alternative models or to be 
dismissive of their relevance, but simply to note that the present study - a first of its kind 
- is necessarily limited in scope, and does not take all these models into account. 

3 Model Discriminators 

At this point we expect that the reader will have been convinced that (a) different models, 
such as the cMSSM, UED(5), LH(T) and SM(4), and perhaps a few others as well, will 
indeed, contribute to new physics signals in the (3£ + MET + jets) and the (4£ + MET 
+ jets) channels, and that (b) the kinematic footprint of different models are likely to be 
different, given the different mass patterns shown in Figure [3j However, in order to be 
precise, we must develop (i) numerical measures of these footprints, and {ii) a systematic 
way of studying them, which would lead to economic and efficient ways of discriminating 
between different models of new physics. These are the goals of the present discussion. 

At this point, it is a good idea to quickly take stock of the main tools one can use for 
kinematic analysis. These are really of three kinds, viz. 

• kinematic distributions, i.e. differential cross-sections in kinematic variables; 

• cross-sections after putting kinematic cuts, which are the same thing in another avatar; 

• multiplicity count, i.e. counting the number of jets, leptons, etc. which emerge after 
putting appropriate kinematic cuts. 

The maximum amount of information will, of course, lie in the first option, but this is 
inconvenient when scanning over the parameter space of an underlying theory. Of course, 
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once there exists a kinematic distribution from the experimental data, one can treat that as 
a standard and check theoretical predictions against it - using some fitting procedure such 
a maximum likelihood fit or a Bayesian analysis. Again, if there is a substantial SM cross- 
section, we can perhaps treat that as a standard and check theoretical predictions against 
it. For the trilepton and four-lepton signals, however, we have neither, and hence, there is 
no single numerical index to tell us whether a given theoretical distribution is good or bad. 
It is more convenient, therefore, to use the second and third options, in which every point 
in the parameter space maps into a single number (the cross-section) or a few numbers (the 
multiplicities). If we plot two such numbers along the axes of a 'signature space', the variation 
of parameters will generate a region in the 'signature space'. Overlapping regions would then 
correspond to non-distinguishability of models; non-overlapping regions will mean that the 
models are distinguishable. 

To make the general discussion above more concrete, in Figure HI we have plotted some 
kinematic distributions (normalized to unity) in the upper half and some multiplicity counts 
(normalized to unity) in the lower half, for a (3£ + MET + jets) signal at the LHC, running 
at 10 TeV. The upper boxes show the transverse momentum p? of the three leptons (ordered 
according to their p?) in the case of the cMSSM, SM(4) and the LH(T) models respectively, 
reading from left to right. Parameters in each of the models have been chosen arbitrarily in 
order to show that the lepton px distribution can look quite similar in all the models. There 
are minor differences between the graphs shown in Figure HI but after detector smearing 
effects are taken into consideration, these are sure to be quite indistinguishable. A mere 
study of the pt spectrum of the leptons will not, therefore be able to discriminate between 
these three models. 

For transparency, in Figure HI the parameters for each model are: 
cMSSM : m = 150 GeV, m 1/2 = 650 GeV, A = 0, tan/5 = 10, [i > 

SM(4) : M(6 4 ) = 600 GeV 

LH(T) : M{W T ) = 650 GeV, M(B T ) = 150 GeV, M(£ T ) = 560 GeV, M(q T ) = 1 TeV 

However, there are many points in parameter space where similar distributions between 
models may be obtained, so this particular choice of parameters has no more significance 
than the fact that it serves to illustrate the argument. We have not shown the UED(5) 
case in Figure HI because there the leptons will tend to be much softer than the ones shown 
in the figure. However, that does not mean that the UED(5) model is distinguishable by 
these alone, since soft leptons can arise in some of the other models as well, for appropriate 
parameter choices. 
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Fi gure 4: Sample kinematic distributions for one point in the parameter space (see text) of each of the three 
models cMSSM, SM(4) and LH(T). The three upper boxes show the pt spectrum for the leptons in a trilepton 
signal, with the red, green and blue histograms corresponding to the three leptons in order of hardness. The 
three lower boxes, show, for the same choices of parameters, the jet multiplicity in each of these models.. 



The three boxes in the lower half of Figure 0] show the jet multiplicity in each of the three 
models as indicated, for the same choice of parameters as in the upper boxes. For the 
record, these jets have been generated using PYTHIA, with appropriate hadronization and 
fragmentation as implicit in that software, and then collected into jets using the inbuilt toy 
calorimeter subroutine PYCELL. Though this is not a very sophisticated jet-making tool, 
it is known to be tolerably accurate, and in any case, a glance at the figure will show that 
smearing or no smearing, these multiplicity distributions are vastly different in all the three 
cases. Physically, this is not so difficult to understand: 

• For the cMSSM, the (3£ + MET + jets) signal arises when there is a chargino-mediated 
decay on one side and a neutralino mediated decay on the other side. The dominance 
of the qq* production mode is reflected in the fact that the maximum number of events 
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have 2 jets. The few single jet events can be variously attributed to loss of a jet through 
merging, extreme softness, passage down the beam pipe or excessive scatter of hadronic 
clusters. The next-most dominant process is qg production, which leads to 3 jets, and 
this is indeed the next highest bar in the histogram. Finally gg production, which is 
sub-dominant, provides most of the 4 jet events. Higher multiplicities arise when the 
W ± bosons arising from X1/X2 cascades decay hadronically. 

• For the SM(4), three-leptons are found when three of the W bosons in the chain decay 
leptonically, and one decays hadronically, i.e. mostly into two jets. Taken together 
with the two 6-jets produced at the first step in the cascade, one would expect all 
trilepton events to be accompanied by 4 jets. Indeed the largest number of events do 
have 4 jets, but many of these merge or are otherwise lost to give substantial numbers 
of events with 2 and 3 jets. Single jet events are comparatively rare, as is only to be 
expected, since that would involve loss or merging of 3 of the 4 jets. 

• For the LH(T), on the other hand, there are only two hard jets which arise from the 
decay of the parent qx (or qx). There is a little bit of merging, etc., but hardly any 
fragmentation effects to speak of. 

These jet multiplicities immediately indicate to us how to go about defining a discriminator 
variable. Let us consider the jet multiplicity distributions of Figure H] and define 

^ number of trilepton events with 3 or more jets 
number of trilepton events with 2 jets or less 

A quick glance at the three boxes in the lower half of Figure H] shows that D will be around 
unity for the cMSSM, much larger than unity for the SM(4) and very small for the LH(T). 
The fractional values given in the figure can be converted into numbers by multiplying by 
the total number of events (which is going to cancel in the ratio anyway). The exact figures 
are 0.9 for the cMSSM, 7.9 for the SM(4) and 0.01 for the LH(T), and they span almost three 
orders of magnitude. Unfortunately, this wide separation between values of D calculated in 
the three models is not always the case as we scan over the parameter space. For example, in 
LH(T) models, as the mass of the q? is brought down from 1 TeV to close to the Wt mass, 
the jets produced in q? —>■ Wt decay will tend to get softer and will fragment more. In this 
case, the value of D calculated in the LH(T) model will increase substantially. It may not 
overlap with the SM(4), but it can be expected to overlap with cMSSM events in some part 
of the cMSSM parameter space where the qq* dominates the others. 

Nevertheless, D is a good and robust discriminator, because of several reasons, viz. 

1. Since it essentially a ratio of cross-sections its value is independent of many multiplica- 
tive factors, such as luminosity, overall coupling constants, etc. 
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2. Since it involves two parts of the same cross-section, the major part of the PDF de- 
pendence also cancels. However, since the identification of jets and the jet formation 
algorithm depends on the kinemetics, the jet multiplicity may depend marginally on 
the colliding parton momentum fractions x\ and x-i- This may in turn, induce a small, 
sub-leading, PDF dependence. 

3. For the same reasons as above, the dominant parts of the QCD and electroweak cor- 
rections as well as factors due to multi-particle interactions, initial and final state 
radiation and detector efficiencies also cancel out. 

The robustness of D is, however, affected by the requirement that in calculating the ex- 
perimental value of D, the denominator should not be zero or even very small. To have a 
reasonable number in the denominator, the overall cross-section and luminosity again come 
into play, and through these, all the uncertain factors mentioned above. In this work, we 
have chosen to calculate D and all D-\ike ratios only if both numerator and denominator are 
> 5, for a given luminosity benchmark. Thus, the parameter space accessible through D and 
D-\ike variables gradually increases as the luminosity increases. This will become apparent 
in the next section, where we discuss our results. 

We must also take note of the fact that the information which enables us to discriminate 
between models is not entirely enshrined in the form of cross-section ratios like D above. 
We can also use, with great profit, the actual magnitudes of the cross-section. For example, 
the production of a g\g\ pair yields a much greater cross-section than a gg pair for the same 
mass range. As the coupling is just a strong coupling, this is a consequence of the different 
spin of the g\ as opposed to the g. Again, if we consider the LH(T) model, in general, 
there is no counterpart to the gg and gq (or g i g 1 and giqi) processes, so one would expect 
a smaller cross-section for the same mass range of the heavy q?- Of course, the small cross- 
section in the LH(T) model for light qr production can be mimicked by small cross-sections 
in the cMSSM or UED(5) for heavy particle production. However, in the latter case, one 
expects to see larger mass splittings in the spectrum, which will show up in an appropriately- 
constructed ratio. Thus, combinations of a cross-section and a ratio may also lead to useful 
discriminators. However, when using a cross-section value as a discriminator, one has to 
realise that this is subject to variation due to all the effects mentioned above as cancelling 
for the D parameter. Thus, only wide separations in cross-section should be taken seriously 
for discrimination purposes. 

Taking all the above considerations into account, and reflecting on the nature of the signals 
in the models under consideration, we have constructed a set of eight variables of event- 
counting nature, which, we believe, would retain the information required to discriminate 
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between models at the LHC. These variable are listed in Table 1 below. Instead of using 
cross-sections, we have used event numbers assuming a given integrated luminosity £ (which 
has been taken at different benchmark values in our numerical analysis). To define these 
event counting-type variables, we have used specific hardness criteria for both leptons and 
jets. These are 

• A lepton will be considered hard if its transverse momentum satisfies p l T > 50 GeV ; 

• A jet will be considered hard if its transverse momentum satisfies > 150 GeV. 

The rationale for these hardness criteria may be obtained by glancing at Figure [3] and noting 
that these would exclude most of the leptons and jets formed in the UED(5) model, for a 
large part of the parameter space. The other models depicted in the figure are not affected 
so strongly. 

Variable Definition 

number of events with n hard leptons and < 2 identifiable jets 
number of events with n hard leptons and > 3 identifiable jets 
Nq number of events with n leptons and no hard jet 

Ni number of events with n leptons and > 1 hard jet 

v number of events with > 1 hard leptons and no hard jets 

z/o number of events with no hard lepton and no hard jets 

v' number of events with > 1 hard lepton and > 1 hard jet 

z/q number of events with > 1 lepton and > 1 hard jet 

Table 1: Event counting-type discriminators for 3£ and Ai signals at the LHC, i.e. n — 3,4. 

In terms of the event-counting discriminators listed in Table 1, we are now in a position to 
define six ratio-type discriminators. These are 

iV 2 (n) v 

for n = 3,4. All of these share the robustness properties defined for 'D' above, but perform 
different functions so far as distinguishing between models is concerned. This is now discussed 
below, where we set out our naive expectations for each of the models under consideration. 
The reason these are called 'naive' is because we do not know a priori how many extra jets 
will be formed by fragmentation, how many leptons will excite the trigger even though they 
come from semi-leptonic decays of c and b quarks in the jets, and such things. It is true that 
these effects are sub-leading in nature, but even so, one cannot completely neglect them, as 
the discrepancy between our naive expectations and the actual results obtained on running 
Monte Carlo simulations will presently show. 
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As an illustration of the discriminating power of these ratio-type variables, let us consider 
the variable D 3 , which is defined in Equation j6| as D3 = /N^\ This is equivalent 
to taking the jet multiplicity distribution corresponding to a trilepton signal, as shown in 
Figure H] and partitioning it into two parts - one with 0-2 jets, and one with 3 or more 
jets. The ratio D 3 of these partitioned cross-sections is, in fact, identical to the variable C D' 
defined in Equation (jSJ). Let us now see what our model- wise expectations for this variable 
D3 = D are. 

1. In the cMSSM (cf. Figure [2]), the number of parent partons available to form final 
state jets (apart from radiated gluons) can be 2, 3 or 4 depending on whether the 
initial state is qq, qg or gg respectively, with the largest cross-sections corresponding 
to 2 nascent jets. There will be some migration of events in the direction of increasing 
jet multiplicity as a result of jet-fragmentation and some migration in the direction of 
decreasing jet multiplicity because of jet-merging. One cannot tell a priori if one will 
dominate the other without doing an actual simulation. Nevertheless, it is reasonable 
to guess that the ratio of these cross-sections (or events numbers) will be of the order 
of unity, as indeed, it is for the parameter choice of Figure HI 

2. In the UED(5) model, the argument is exactly identical with respect to parent partons, 
as the processes in question mimic the cMSSM processes almost exactly (cf. Figure [2]). 
Nevertheless, in this case, the near-degeneracy of the UED(5) spectrum causes the jets 
to be much softer than their cMSSM counterparts. In this case, the jet multiplicity 
will tend to be lower, partly because many of the jets will not be identifiable as such 
because of their extreme softness, and partly because soft jets tend to spread out more 
and merge more. We can, therefore, guess that D3 will be smaller for the UED(5) 
model than it is for the cMSSM. To make a more precise estimate requires a detailed 
simulation. 

3. In the SM(4), we have earlier seen that the parton-level count for nascent jets is 4, and 
all lower jet counts to arise from merging or jet-loss effects. Hence we expect D 3 to be 
larger than unity. Again, the exact figure will depend on the mass splittings and the 
jet identification and merging algorithms. 

4. In the LH(T) model, again, we have earlier argued that the parton-level jet count is 
just 2, and hence larger number of jets will arise only as a result of fragmentation etc. 
Thus, we expect D 3 to be very small for the LH(T). 

Of course, all these estimates must be qualified by the scatter induced by parameter variation. 
Nevertheless, as the estimates are based on event toplogy rather than details of the spectrum, 
we expect a loose hierarchy in D3 of the form Df^^ > £)cMssm > £>ued(5) ^ £)^h(t) Tjm ess 
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there is some major surprise in the trilepton signal, the experimental number is likely to 
eventually fall into one of the ranges covered by these different models. 

The behaviour and discriminating power of the _D 4 variable is rather similar, and the only 
difference lies in the fact that it corresponds to a 4 lepton trigger. This makes it truly 
different only for the SM(4) case, where now all four W bosons must decay through leptonic 
channels, and hence the parton-level expectation becomes 2 jets instead of 4. Thus, we 
expect -D4 to be small for the SM(4) unlike the D%, which is large. Correlating a small D4 
with a large D 3 is, therefore, a characteristic of the SM(4). 

We can make similar arguments for the A^ and A' e variables. Instead of elaborating further, 
we present our naive expectations for all the ratio-type variables concisely in Table 2 below. 



Variable 
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Table 2: Naive expectations for ratio-type discriminators in the four models under consideration. The 
symbols used are to be read as follows. L implies large, i.e. order 10 or more, M implies medium, i.e. order 
unity or thereabouts, and S implies small, i.e. order 0.1 or less. The empty slot arises because there are 
naively no hard jets to speak of in the LH(T) model and hence the ratio in question must depend on effects 
which cannot be predicted without carrying out a simulation. 

We can now easily discriminate between these models by combining these ratio variables 
appropriately. For example, if we wish to discriminate between the cMSSM and UED(5) 
models, we consider and the A^, A' e . If we wish to discriminate between the cMSSM 
and the SM(4), we consider the .D34 and -D34, and so on. A systematic study, considering 
the accessible parameter space in each of the four models is described in the next section. 

4 Addressing the Inverse Problem 

A statement of the full LHC Inverse Problem would be somewhat as follows: // we observe 
a deviation from the SM predictions at the LHC, how do we determine its underlying cause? 
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If it is due to new physics, how do we identify the model and determine its parameters? 
This apparently-simple question actually requires one to cover a vast canvas of facts and 
inferences, as LHC data will be somewhat limited in scope. After all, the only observables 
will be a bunch of leptons and jets and their momenta in the transverse plane, and everything 
else will have to be inferred from this data. 

In this article, we concentrate on a limited portion of the general inverse problem. Through 
the previous sections, we have described the signals with 3£ + MET + jets and A£ + MET 
+ jets in pp collisions at the LHC, which have little or no SM background. If a signal is seen 
in any of these two channels, four of the most likely new physics candidates are the models 
denoted cMSSM, UED(5), SM(4) and LH(T). The inverse problem in this case consists of 
trying to identify which of these four models is the cause of the observed signal. Of course, 
the signal, if seen, could also be due to some other model - even, in principle, to completely 
new physical. 




In the previous section, we have shown how we can define discriminating variables of two 
types, viz. event-counting type and ratio-type. The method is economical in the sense that 
the ratios are of pairs of event-counting variables and do not need separate data. In fact, the 
raw experimental data would require a trigger on leptons and MET, and the jet multiplicity 
for each event would be stored. This is a simple requirement and is certainly on the agenda 
for both the CMS and ATLAS Collaborations when the LHC becomes operational. With 
this limited data, we can construct all the variables defined in the previous section. At the 
present juncture, however, we do not have data, and in any case, we require to see how 
accurate are our guesses regarding these variables, as exhibited in Table 2. This requires a 
detailed Monte Carlo simulation of the events, which has been performed and is described 
below. 

The principal tools in our numerical analysis of the problem are 

1. The event generator CalcHEP[16j, which generates parton-level events for new physics 



2. The event generator PYTHIA[15j. which hadronises partons and creates hadron show- 
ers, collecting them into jets using a simple toy calorimeter algorithm called PYCELL, 
and, finally, 

3. The software SUSPECT[25J which creates a cMSSM spectrum from the standard input 
parameters and imposes known experimental constraints on it. 

8 Though that would come as a surprise, and is not, by its very definition, predictable. 




models 
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In general, experimental constraints have been imposed in a very conservative fashion on 
all the models in question. For example, though three of the models have dark matter can- 
didates, we have not imposed a constraint from the cosmic relic density [I2.]. Similarly, the 
LH(T) model might be severely constrained by low-energy data[26j, but this has not been 
explored in great detail yet. The philosophy adopted in this work is that we shall consider the 
maximum possible parameter ranges and see if we can avoid overlaps between the discrimi- 
nating variables for variation over these full ranges. Further constraints can only cause the 
allowed parameter space — and hence the range of the discriminating variables - to shrink 
further, thereby reducing potential overlaps. What we exhibit in this article is, therefore, 
the worst case scenario in every variable. If a positive result is obtained notwithstanding 
this, then we may be sure that it can only be further strengthened by further experimental 
constraints^. 

To get a generic trigger which will work for each of the models under consideration, we have 
imposed the following kinematic cuts on leptons for identification: 

• The transverse momentum p l T should satisfy p l T > 10 GeV. This is an acceptance cut 
required to excite showers in the electromagnetic calorimeter (EMC) for both LHC 
detectors. 

• The pseudorapidity % should satisfy T]g < 2.5. This roughly covers the barrel and end 
caps for the EMCs. 

• There should be no hadronic clusters with total energy Eh > 0.3 Eg in a cone of semi- 
vertical angle AR = 3.5 around the lepton momentum This is an isolation 
criterion. 

The criterion for labelling a lepton as 'hard' has already been described in the context of the 
definitions in Table 1. In addition to this we require the total missing transverse momentum 
p™ lss to satisfy 

p™ ss >20 GeV. (7) 

Once an event passes the above triggers, the hadronic final states are collected into jets. In 
our Monte Carlo simulations, we have used, as mentioned before, the PYCELL toy calorime- 
ter routine. To be identified as a jet, we impose the simple criteria: 

• The transverse momentum p 3 ^ should satisfy p 3 ^ 1 > 20 GeV. This is an acceptance cut 
required to excite showers in the hadron calorimeter (HCAL) for both LHC detectors. 

9 Unless, indeed, these are so strong as to rule out the model altogether 
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• The pseudorapidity T]j ei should satisfy r]j Ct < 2.5. This roughly covers the barrel and 
end caps for the HCALs. 

Energy and angular smearing are inbuilt in the PYCELL algorithm and are not imposed 
separately. Thus, we expect our simulated jets to resemble the actual jets observed at the 
LHC fairly closely. Naturally, in the LHC experiments, more sophisticated jet-identification 
techniques will be employed, but we do not expect to see dramatic deviations from our 
results. Thus, for each of the events simulated, we note the jet multiplicity, and this enables 
us, for a given luminosity, to generate the event-counting variables defined in Table 1. As 
for leptons, the criterion for labelling a jet as 'hard' has already been described. Once we 
have all this information, it is a simple matter to compute the ratios defined in Equation (jHJ), 
subject to a requirement that both numerator and denominator should be greater than 5, 
for the given luminosity. 

In order to carry out a Monte Carlo simulation, we have generated 30,000 pair-production 
events for each point in the parameter space for each of the models under consideration. 
This large number corresponds, for a cross-section of 1 pb, to a luminosity of 30 fb _1 , which 
is a conservative estimate of the maximum attainable at the LHC. Of course, for lower 
luminosities, all event-counting variables simple scale as the luminosity, unless, indeed, the 
numbers are very small and sensitive to statistical fluctuations. 

A detailed scan over the allowed parameter space for each of the four models in question 
would require an immense amount of computer time, and is probably not called for at the 
present juncture. In any case, our purpose in this article is merely to illustrate how, by 
correlating discriminator variables as defined, we can distinguish between models. To obtain 
a quick estimate of the spread in these variables as we scan the parameter space, we employ 
a standard trick, which is to make a random sampling of the parameter space over its full 
accessible range. If a sufficiently large number of such random points is chosen, we expect a 
reasonable complete sampling of the parameter space, taking care of most of the kinematic 
as well as dynamic behaviour of the particles in the model. The possibility that there exists 
an isolated, un-sampled point in the parameter space where this behaviour would be very 
different is militated against by the fairly continuous behaviour of the spectrum and coupling 
constants in the models under question. The only singular behaviour really expected arises 
from particle resonances, but these are nicely smoothed out everywhere using the Breit- 
Wigner approximation. Thus we can argue that our random sampling of the parameter 
space actually leads to a fairly good representation of the spread in discriminating variables 
induced by scanning the parameter space. Before checking on these, however, it is necessary 
to mention the actual ranges over which the parameters are varied and the experimental 
constraints thereon. 
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1. In the cMSSM, the parameters are chosen randomly within the ranges given below: 
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The lower bounds on mo and roughly correspond to the bounds from LEP-2 and 
their upper bounds represent a stage when the squarks and gluinos are too heavy to be 
produced in any useful numbers at the LHC. The latter argument also applies to the 
magnitude of Aq. The range in tan/3 starts from the LEP and Tevatron lower bound 
and is cut off roughly at the point where the top quark Yukawa coupling to the charged 
Higgs bosons would become non-perturbative. The random parameter choice within 
these limits is fed into the software SUSPECT, which generates the cMSSM spectrum 
and couplings by solving the renormalization group (RG) equations evaluated at the 
one- and two-loop levels (as appropriate). Seven standard constraints [27] are applied 
to the cMSSM parameter space as follows: 

(a) The electroweak parameter p should satisfy (A/?) c mssm < 2.2 x 10~ 3 . 

(b) The muon should satisfy -7.7 x 10~ 10 < (g - 2)^ < 4.7 x 10~ 9 . 

(c) Radiative 6-decays should satisfy 2.65 x 10~ 4 < BR(B K*^) < 4.45 x 10~ 4 . 

(d) The lightest scalar h° should satisfy M h > 93 GeV. 

(e) The light chargino xt should satisfy M~± > 104.5 GeV. 

(f) The light stop t x should satisfy M Ti > 101.5 GeV. 

(g) The light stau r x should satisfy M ?1 > 98.8 GeV. 

A further constraint on the parameter space arises from the requirement that it should 
be accessible at the LHC. Noting that a trilepton signal will call for at least a suppres- 
sion by the leptonic0 branching ratios of a W and a Z, i.e. 0.21 x 0.067 = 0.014, it is 
clear that unless the initial cross-section for pair production of squarks and gluinos is 
at least 10 fb, we will not get any signal worth analysing for 30 fb _1 of data. Obviously, 
this constraint can be applied only after calculation of the cross-sections in question. 
Fortunately, this is mostly taken care of by the upper limits chosen for m and m^j. 
Note that we have not applied the dark matter constraint which would require the ob- 
served cosmic relic density to match with its cMSSM prediction [12]. This, as explained 
before, is a conservative standpoint, and imposition of the dark matter constraint can 
only shrink the cMSSM parameter space further. 

Recall that at the LHC we trigger only on electrons and muons, and not on taus. 
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Having decided on a viable point in the parameter space, then, the corresponding spec- 
trum and couplings are generated by the SUSPECT codes and read in by the PYTHIA 
software, which is geared to 'generate' gluino and squark pairs and simulate their cas- 
cade decays. Only those 'events' are retained where the final state satisfies the 10 fb 
constraint mentioned above and contains 3 or 4 identifiable leptons as well as missing 
Pt, as per the criteria given above. For these events, the jet multiplicity is counted. 
This enables us to classify the events into appropriate bins, whose accumulated num- 
bers enable us to calculate the variables defined in Table 1 and Equation (jBJ). 

2. For the UED(5) model, there are only two parameters, viz. the size R of the extra 
dimensions (usually represented by its inverse -R -1 , and the cutoff A for the theory, 
usually scaled as AR. We have chosen the points randomly in the ranges 

300 GeV < iT 1 < 1.5 TeV 5 < AR < 20 

which are fairly conservative bounds for the UED(5) model. The lower bound[18] of 
300 GeV on R~ x is obtained by stretching the Tevatron constraint to its 5a limit, while 
the upper bound of 1.5 TeV is an accessibility limit as it more or less corresponds to 
cross-sections for g x and gi pair-production below the 10 fb limit explained above. The 
requirement that A should remain above the LHC-accessible region determines the 
lower bound of 5 on AR, while the upper bound is determined by the requirement that 
the cutoff should not create a new and unnatural hierarchy of scales in the UED(5) 
model. Of course, both the upper and lower bounds 5 and 20 are ballpark numbers, 
which we use to estimate the parameter spread - they could very well have been 
replaced by 4 and 22, for example (but not by, say, 2 and 30). 

Once the point in the UED(5) parameter space is chosen, the couplings and parton- 
level cross-sections for production of q±qi, qig± and g±g± pairs are calculated using the 
event generator CalcHEP, which has the provision for inclusion of new physics beyond 
the Standard Model. The parton-level momenta are fed into PYTHIA, using the 
LHEF formatting [28], and further development of hadronic final states is carried out by 
PYTHIA. The triggering, counting of jet multiplicity and calculation of discriminating 
variables then proceeds exactly as in the cMSSM case. 

3. The analysis for the LH(T) model is done very similarly to the analysis in the UED(5) 
case. Here there are three parameters, viz, the symmetry-breaking scale / and the 
Yukawa couplings n q and K£. The ranges over which these are permitted to vary are 

500 GeV < / < 1.5 TeV , 0.25 < n q < 1 , 0.25 < n t < 1 . 

The lower bound on / is induced by the lower bound M(Bt) > 80 GeV which arises 
from precision electroweak measurements [29] . The upper bound is an accessibility 
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limit, since it more or less corresponds, as in the UED(5) case for -R -1 , to a situation 
where the cross-section for qx pair-production falls below the 10 fb limit. Of course, 
every random choice of parameters in the LH(T) parameter space described above 
does not lead to trilepton and four-lepton signals at the LHC. The masses must be 
compatible with Equation (j3J) in order to see such signals. Only those points which 
lead to such a mass hierarchy are, therefore, selected out of a large random sample. 
Once this has been done, the event generation can be carried out by a CalcHEP- 
PYTHIA combination through an LHEF interface [30J, as in the case of the UED(5) 
model. 

4. The simplest parameter space is the case of the SM(4), where we simply allow M(b±) 
to vary between 350 GeV and 600 GeV, i.e. between the experimental lower bound 
and the theoretical upper bound where the corresponding Yukawa coupling would lead 
to a Landau polejll]. The mass of the £4, the other free parameter of the theory, is 
not relevant for our analysis, as has been explained before. The simulation of events in 
the SM(4) is done using PYTHIA, more or less in the same way as the previous cases. 
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Figure 5: Correlation plots between event counting-type variables (a) N% against N^ 3 \ and (b) 
against N^ 3 \ The box drawn with broken lines shows the reach with a luminosity of 1 fb^ 1 , for which the 
axis labels should be scaled by a factor of 0.01 on either axis. These numbers all correspond to y/s = 10 TeV. 

We are now in a position to exhibit and discuss our results. The simplest way to discrimini- 
nate between models is to correlate a pair of event counting-type variables. Several of these 
are defined in Table 1, and can be plotted against each other. The best results arise when 
we plot (a) against , and (b) against . These are exhibited in Figure [5j 
where the left box corresponds to the option (a) and the right box to the option (b). This 
correlation plot is a two-dimensional example of a 'signature space' defined in Ref. [13]. We 
may note that the variable D 3 is the slope of the graph in the box (a). 
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There are two nested boxes in Figure [51 The outer one, which corresponds to a rather 
optimistic projection of an integrated luminosity of 100 fb" 1 , is correctly labelled on the two 
axes. The inner one, two sides of which are represented by broken lines, indicates what would 
be obtained with an integrated luminosity of 1 fb _1 , i.e. an early result. For the 1 fb -1 - 
option, the labels on the two axes must be scaled by a factor of j^. Within the boxes, 
clusters of points are colour-coded to represent the different models, but are also labelled 
alongside and indicated with arrows to avoid confusion in a black-and-white printout. Each 
point of a given colour on the graph represents a point in the parameter space. The clusters 
of points of a given colour represent the Monte Carlo equivalent of a filled region of parameter 
space, and hence the rough boundaries of each cluster may be regarded as an approximate 
contour delineating the region predicted by a specific model on the plot in question. 

What are the conclusions one can draw from Figure EP If one considers the left box, where 
Nr, is plotted against N$ , it is clear that there are two distinct regions - one which is pop- 
ulated by the slightly-overlapping UED(5) and LH(T) clusters, and one which is populated 
by the overlapping cMSSM and SM(4) clusters. The distance between these two regions is 
not small if we note that this is a logarithmic plot. Thus, the qualitative results will not 
change even if we see fairly substantial corrections due to PDFs, radiative effects, etc. The 
reality, of course, should ideally correspond to a single point in this graph, but in practice, 
we will get a square blob due to experimental errors. There are now three possibilities, each 
calling for a separate comment. 

• If the experimental blob should lie squarely in one of the regions where just one of the 
four models is unambiguously indicated, this will provide an immediate solution to the 

(3) (3) 

inverse problem for this signal. For example, if N% = 100 ± 10 and iV 2 = 10 ± 3, the 
cMSSM must be the new physics, and the other three models are ruled out. 

• If the experimental blob lies in or near the overlap region of two of the models in 
question, one cannot distinguish between these two models (by looking only at this 
graph), but the other two models get ruled out. For example, if = 1000 ± 32 
and iV 2 (3) = 20000 ± 140, then the new physics may be either the UED(4) or LH(T) 
models, but the cMSSM and the SM(4) are definitely ruled out. The channel separating 
the cMSSM and the UED(5), which separates the clusters into two distinct parts, is 
sufficiently wide so that even with an error oc yN, an experimental blob lying close 
to one of the regions cannot have an overlap with the other region. Thus, we may be 
sure that at least two of the models will always be eliminated. 

(3) 

• If the experimental blob lies away from all the clusters, e.g. we have N% = 10000 ±100 

(3) 

and iV 2 = 10 ± 3, then we will have a signal for new physics, indeed, but it can 
correspond to none of the models under consideration. In that case, we would not 
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have a solution for the inverse problem, but will have the somewhat cold consolation 
that at least four of the most popular models are ruled out. 

The purposes of this work are served if we consider the first two possibilities; the third 
requires an altogether different kind of study. If the first should happen, we can consider 
ourselves lucky and treat the subsequent discussion as a method to shore up the conclusions 
already reached. If the second case should happen, which is equally probable, we now require 
to see how one can further disentangle the models by looking at other variables. The first 
thing to do is to look at other event-counting variables. The best combination is exhibited 
in the right box in Figure El where an trileptonic variable and a four-leptonic variable are 
plotted against each other. As we have already discussed, four-lepton signals show more 
variation in the jet complement than trilepton signals, especially for the SM(4), and hence, 
a better discrimination may be obtained by using a four-leptonic variable. This, is, in fact, 
apparent when we look at the plot of against . Not only is the channel between the 
cMSSM and UED(5) clusters wider and cleaner, but now the overlap between the UED(5) 
and LH(T) clusters is very small, and similarly, the narrow band representing the SM(4) has 
almost disengaged itself from the cMSSM cluster. The possibility of having an unambiguous 
verdict from the experimental data is, therefore, somewhat better in this plot than in the 
one on the left. However, it has to be admitted that complete disambiguation cannot be 
done by considering these two figures, either in isolation, or taken together. 

We take note of the fact that if the experimental signal is large enough to be seen even with 
1 fb _1 of data, the discrimination properties of the plots in Figure |5] (or the lack thereof) 
remain undiminished. Increase in the luminosity will make more of the parameter space 
accessible (except for the SM(4) case), but in separate patches outside the box with broken 
lines, where there is practically no overlap, except for a few outlying points. 

The option of using only event counting-type variables having achieved partial, but not 
complete, success indicates that we should next try to combine event counting-type variables 
with ratio-type variables to get a better discriminatory power. Here the best options are 
the plots shown in Figure [6l where the upper boxes have the ratio A^ plotted against the 
number N!f\ and the lower boxes show the same ratio A^ plotted against the number . 
In each case, the box on the left shows the prediction for a luminosity of 1 fb , i.e. the 
early data at the LHC, while the box on the right corresponds to 30 fb -1 , which may well be 
the final data sample at the LHC. Note that there is no real qualitative differences between 
the two luminosity options, except that in the high luminosity case, many more points in 
the parameter space are accessible, which is as expected. 

If we recall that the problem in Figure [5] was that of separating the cMSSM-SM(4) and 
UED(5)-LH(T) cluster pairs, then Figure [6] is very interesting. Obviously, the UED(5) 
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Figure 6: Correlation plot between event counting-type variables versus a ratio-type variable for ^/s = 

(3) 

10 TeV. The upper boxes have the ratio Af plotted against the number , and the lower boxes show the 
same ratio Af plotted against the number ■ In each case, the box on the left (right) shows the prediction 
forC = l fb- 1 (30 fb- 1 ). 



and LH(T) clusters are now widely separated, with A^ playing the role of a single-variable 
discriminator. If we look at the definition of A^ in Equation ([6]) and Table 1, it is clear that 
this is simply related to the fact that in the LH(T) model we will almost always have a hard 
lepton, while in the UED(5) model, that is a rarity. At the same time, there is also a clear 
separation of the SM(4) cluster from the cMSSM cluster, except for some outlying cMSSM 
points which appear for the higher luminosity. Even so, the separation between these clusters 
in the plot in the lower boxes is quite substantial - this is not unexpected, since the abscissa 
is a four-lepton variable, where the SM(4) has a clear bias towards a smaller number of jets. 
Thus, if we combine the information from Figure [5] as well as Figure El it is clear that we will 
know quite clearly which of the four models under study is the cause of an excess leptonic 
signal. 
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Figure 7: Correlation plot between ratio-type variables for ^fs = 10 TeV. The upper boxes have the ratio 
D4 plotted against D%, and the lower boxes show the ratio -D4 plotted against the same variable D3. In each 
case, the box on the left (right) shows the prediction for C — 1 fb~ x (30 fb~ x ). 



Our analysis could have stopped at this point, since the goal of disentangling signals for each 
of the four models in question has been achieved. However, we have even nicer results when 
we correlate pairs of ratio-type variables as defined in Equation ([6]). In Figure [7] we plot the 
ratios D3 and D4 against each other in the upper half, and the ratios D3 and -D4 in the lower 
half, for luminosities of 1 fb _1 and 30 fb _1 on the left and right respectively. A glance at the 
figure will show that we have four clearly delineated clusters of points corresponding to the 
four models in question. We note that these are logarithmic plots, and hence, what looks like 
a small separation between, say the cMSSM cluster and the SM(4) cluster, is actually quite 
significant. Thus, if signals are seen at the lower luminosity of 1 fb _1 , clearly, any one of the 
plots should achieve, at one go, a clear disambiguation between all four models. If signals 
appear later, when 30 fb _1 have been collected, there may be some difficulty in separating 
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the SM(4) from the cMSSM, as the clusters are quite close. We have already discussed why 
the clusters grow in size when the luminosity increases. 

We can make several plots of similar nature, for example, by taking the four-lepton variable 
-D4 and plotting it severally against A^ and A[. Most of these plots show the following 
common features: 

• Each of the models results in a separate cluster of points, which is somewhat scattered 
for 1 fb^ 1 but compact and dense for 30 fb _1 . 

• In each case, there is a small overlap or proximity between just one pair of the models 
in question, others being widely separated. 

• In general, the hardest to separate seem to be the cMSSM cluster and the SM(4) 
cluster. 

The last problem is not insurmountable, however, as we have already seen in the context of 
Figure El This finds its neatest manifestation in Figure [HI where we plot D4 against A^ and 
A' e respectively. A glance at Figure M will show that there is now a clear separation between 
the (blue) SM(4) cluster and the (red) cMSSM cluster. Experimental errors will be too small 
to permit any experimental blob to span this gulf. Thus, even if all else fails, this plot will be 
enough to distinguish between the SM(4) and cMSSM cases. The plot is also good enough 
to distinguish the UED(5) uniquely, since the corresponding points form another isolated 
cluster. What it fails to do is to completely separate the SM(4) from the LH(T) model, but 
that is not a cause of worry, since these two models lead to widely separated clusters (by 
more than two orders of magnitude in D 3 ) in Figure [71 

We can thus present a clear and unambiguous prescription for solving the LHC inverse 
problem in trilepton and four-lepton signals. All that requires to be done is to 

1. trigger on events with 3£ + MET with or without accompanying jets, as well as on 
events with A£ + MET with or without accompanying jets; 

2. using a suitable jet counting algorithm, determine the jet multiplicity for each of the 
triggered events, and use this information to populate jet-multiplicity distributions for 
both kinds of signal; 

3. calculate the event counting-type variables of Table 1, using the jet-multiplicity distri- 
butions and simple variables like lepton and jet px', 

4. calculate the ratio- type variables defined in Equation ([6]); 
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5. make correlation plots as shown in this article: the solution to the inverse problem will 
leap out of the figure. 



1 fb 
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Fi gure 8: Correlation plot between ratio-type variables for y/s = 10 TeV. The upper boxes have the ratio 
Z?4 plotted against Ag, and the lower boxes show the same ratio plotted against A' e . As before, the box 
on the left (right) shows the prediction for L = 1 fb -1 (30 fb~ x ) in each case. 

Before concluding this section, it is relevant to say that the above prescription does not 
require any real extra effort on the part of our experimentalist colleagues over and above 
what they would be doing anyway when the LHC data arrives. Trilepton and four-lepton 
signals with MET are at the forefront of the new physics programme at the LHC in any 
case, and it should require a small modification of the usual analysis to achieve the results 
projected in the present work. 
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5 Critical Summary 



This work has grown out of the oft-quoted statement that trilepton and four-lepton signals 
(accompanied by MET and jets) could be like 'smoking gun' signals of supersymmetry when 
sufficient data are accumulated at the LHC. Such statements beg the retort that the same 
signals could be produced by other models as well, and in this paper, we have considered 
three popular alternatives, viz, the UED(5) or 'bosonic supersymmetry', the LH(T) and 
the simplest extension of the SM, which involves adding a fourth sequential generation of 
fermions. We have explained at length how this happens, what are the decay chains respon- 
sible and what kind of parameters one requires to have a competitor for supersymmetry, 
embodied as the cMSSM. Our analysis has been carried out as objectively as possible, i.e. 
without any bias in favour of or against supersymmetry or any other model. 

The major points made in the article are as follows. We first note that at the LHC, the most 
viable signatures for new physics beyond the SM will occur when we have heavy particles 
produced through strong interactions, but decaying into leptonic final states. This can 
happen only if there is a conserved (or nearly conserved) quantum number), and three of the 
four models (cMSSM, UED(5) and LH(T)) do have such a quantum number corresponding 
to a discrete Z 2 symmetry in the theory. This also makes the lightest particle carrying a 
nontrivial Z 2 quantum number a stable particle and hence a dark matter candidate. Thus 
models of this nature are important to investigate. We focus on final states with 3 or 4 
leptons, missing transverse momentum and an unspecified number of jets. These are shown 
(Figure [2]) to arise in all of the models in question, which brings us to an inverse problem - if 
we do see a signal of this nature, how do we know which of the models is the correct one? We 
then show (Figure HI) that the lepton px is not enough to distinguish between these models, 
because it can look identical for certain parameter choices of the models. However, for the 
same choices, the jet multiplicity distributions look very different. Using this hint and some 
general arguments based on the event topology in the individual models, we then go on to 
define a set of discriminating variables (Table 1 and Equation ([6])). Finally we do a numerical 
survey of the models in question, creating a cluster of points in each discriminating variable 
by random sampling of points in the parameter space. It is seen that by plotting suitable 
pairs of the discriminating variables against each other (Figures [5HH]), the four models can 
generally be made to fall into distinct regions of the diagram, with some small degree of 
overlap. The latter is not a serious problem, because models which show overlap for one 
pair of variables get widely separated for a different set of variables. Hence, we arrive at a 
rather elegant way of solving the LHC inverse problem in trilepton and four-lepton signals 
(accompanied by MET and jets) - there will be only one point (with error bars) in each of 
these planes when the experimental data are available, and one only has to see where it lies 
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to get a spectacular solution to the inverse problem. 

Our work and what it hopes to achieve is summed up in the previous paragraph. To put this 
in perspective, we now come to some of the issues which our work does not address. Apart 
from technical details, such as the quality of Monte Carlo simulation and jet reconstruction 
- where we feel confident that our results, though not as refined as one might have wished, 
have not gone in the wrong direction - there are two major scenarios where our work could 
prove inadequate. 

1. Clearly, in order to identify a particular model as the new physics source, the exper- 
imental data should map, in each one of the Figures EHEJ to points which lie in the 
region corresponding to a particular model. Thus, if the LH(T) is the correct option, 
the experimental blob described in the last section should, in every case, lie within 
the (violet) cluster of points indicating the LH(T). Deviations from this can happen 
either (a) when the experimental blob matches with the LH(T) in some diagrams, but 
not in others, or (b) when the experimental blob always lies outside the area of scatter 
corresponding to all the four models under consideration. Any deviation of this nature 
must be interpreted as due to a different kind of new physics. Even a single mismatch 
will indicate that new theoretical ideas (or at least some modification of these models) 
are required. 

2. The requirement that the experimental blob should lie in the appropriate region of pa- 
rameter space is a necessary condition to identify a certain model, but not a sufficient 
one. For example, if in all the diagrams, the data match with the cMSSM patches, 
then we may say that supersymmetry is strongly indicated, but it does not necessarily 
have to be the constrained version which we have been using in this analysis. Super- 
symmetric models which relax the universality assumptions of the cMSSM, or even the 
124-parameter MSSM with no universality or unification assumptions, are not ruled 
out by this test alone. 

In fact, as mentioned in an earlier section, this article discusses four of the popular con- 
structions beyond the minimal SM which could lead to trilepton and four-lepton signals at 
the LHC. There are other possibilities, which have not been touched upon in this paper. 
Though one can provide arguments against such options (for example, i?-parity-violating 
supersymmetry is unpopular because it does not provide a dark matter candidate), Nature 
has surprised us before and may do so again. Thus, we can only claim to have set out a 
neat technique to approach the inverse problem. We hope that future studies of new physics 
models in the trilepton and four-lepton channels will find this a useful framework in which 
to present the corresponding results. 
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Having mentioned the caveats and worst case scenarios, we now take a positive approach 
and see what will happen if, indeed, the experimental blobs correspond, in each diagram, to 
the cluster corresponding to a particular model. This will immediately rule out the three 
other models^, but it will not tell us which part of the parameter space of the successful 
model is responsible for the signal. However, one can then do a fine-grained parameter 
scan (systematic or random), for the entire set of discriminating variables, and see which 
point (s) in the parameter space can fit the whole set of experimental data. This, rather 
computer-intensive, study can be done only when there is actual data, but it has the merit 
of determining all the parameters of the model together. Thus, if we can find a fitting point, 
the entire mass spectrum and the couplings are determined by the theoretical structure of 
the model. 



In conclusion, therefore, we have studied a particular aspect of the LHC inverse problem 
in a more intensive way than general studies of the inverse problem have done previously. 
We have devised a set of rather robust kinematic variables and set up a set of correlation 
plots which could lead to spectacular solutions of the inverse problem when the LHC data 
become available. Our work has the advantages of simplicity and economy, since we only 
suggest the use of data which will surely be collected by the ATLAS and CMS experiments, 
and we use only number-counting variables, which are more robust than others against 
corrections due to smearing and other effects. We, therefore, conclude this article with a 
hope that the techniques suggested here will find favour in the community as a simple and 
direct approach to the inverse problem at the LHC - a problem which is likely to assume 
paramount importance in a year or two from the present. 
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n This will, of course, apply only to the minimal version of these models. Exotic extensions of any model 
can probably be created d posteriori to spread the corresponding cluster to the experimental blob. 
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